Simultaneous Statistical Inference for Epigenetic Data
نویسندگان
چکیده
Epigenetic research leads to complex data structures. Since parametric model assumptions for the distribution of epigenetic data are hard to verify we introduce in the present work a nonparametric statistical framework for two-group comparisons. Furthermore, epigenetic analyses are often performed at various genetic loci simultaneously. Hence, in order to be able to draw valid conclusions for specific loci, an appropriate multiple testing correction is necessary. Finally, with technologies available for the simultaneous assessment of many interrelated biological parameters (such as gene arrays), statistical approaches also need to deal with a possibly unknown dependency structure in the data. Our statistical approach to the nonparametric comparison of two samples with independent multivariate observables is based on recently developed multivariate multiple permutation tests. We adapt their theory in order to cope with families of hypotheses regarding relative effects. Our results indicate that the multivariate multiple permutation test keeps the pre-assigned type I error level for the global null hypothesis. In combination with the closure principle, the family-wise error rate for the simultaneous test of the corresponding locus/parameter-specific null hypotheses can be controlled. In applications we demonstrate that group differences in epigenetic data can be detected reliably with our methodology.
منابع مشابه
Do Not Remove. Power. in Survey Research Applications of the General Linear Model
In this paper we review neglected issues of simultaneous statistical inference and statistical power in survey research applications of the general linear model, and we find that classical hypothesis testing as it is currently applied, is inadequate for the purposes of social research. The intelligent use of statistical inference demands control over the overall level of Type I error and knowle...
متن کاملValid Post-Selection Inference
It is common practice in statistical data analysis to perform data-driven variable selection and derive statistical inference from the resulting model. Such inference enjoys none of the guarantees that classical statistical theory provides for tests and confidence intervals when the model has been chosen a priori. We propose to produce valid “post-selection inference” by reducing the problem to...
متن کاملPost - Selection Inference
It is common practice in statistical data analysis to perform datadriven variable selection and derive statistical inference from the resulting model. Such inference enjoys none of the guarantees that classical statistical theory provides for tests and confidence intervals when the model has been chosen a priori. We propose to produce valid “post-selection inference” by reducing the problem to ...
متن کاملPOST - SELECTION INFERENCE By Richard Berk
It is common practice in statistical data analysis to perform datadriven variable selection and derive statistical inference from the resulting model. Such inference enjoys none of the guarantees that classical statistical theory provides for tests and confidence intervals when the model has been chosen a priori. We propose to produce valid “post-selection inference” by reducing the problem to ...
متن کاملAdaptive neuro-fuzzy inference system (ANFIS) applied for spectrophotometric determination of fluoxetine and sertraline in pharmaceutical formulations and biological fluid
The UV-spectrophotometric method of analysis was proposed for simultaneous determination of fluoxetine (FLX) and sertraline (SRT). Considering the strong spectral overlap between UV-Vis spectra of these compounds, a previous separation should be carried out in order to determine them by conventional spectrophotometric techniques. Here, full-spectrum multivariate calibrations adaptive neuro-fuzz...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 10 شماره
صفحات -
تاریخ انتشار 2015